Homophily and Latent Attribute Inference: Inferring Latent Attributes of Twitter Users from Neighbors
نویسندگان
چکیده
In this paper, we extend existing work on latent attribute inference by leveraging the principle of homophily: we evaluate the inference accuracy gained by augmenting the user features with features derived from the Twitter profiles and postings of her friends. We consider three attributes which have varying degrees of assortativity: gender, age, and political affiliation. Our approach yields a significant and robust increase in accuracy for both age and political affiliation, indicating that our approach boosts performance for attributes with moderate to high assortativity. Furthermore, different neighborhood subsets yielded optimal performance for different attributes, suggesting that different subsamples of the user’s neighborhood characterize different aspects of the user herself. Finally, inferences using only the features of a user’s neighbors outperformed those based on the user’s features alone. This suggests that the neighborhood context alone carries substantial information about the user.
منابع مشابه
Inferring User Preferences by Probabilistic Logical Reasoning over Social Networks
We propose a framework for inferring the latent attitudes or preferences of users by performing probabilistic first-order logical reasoning over the social network graph. Our method answers questions about Twitter users like Does this user like sushi? or Is this user a New York Knicks fan? by building a probabilistic model that reasons over user attributes (the user’s location or gender) and th...
متن کاملLearning multi-faceted representations of individuals from heterogeneous evidence using neural networks
Inferring latent attributes of people online is an important social computing task, but requires integrating the many heterogeneous sources of information available on the web. We propose to learn individual representations of people using neural nets to integrate information from social media. The algorithm is able to combine any kind of cues, such as the text a person writes, the person’s att...
متن کاملClassifying Political Orientation on Twitter: It's Not Easy!
Numerous papers have reported great success at inferring the political orientation of Twitter users. This paper has some unfortunate news to deliver: while past work has been sound and often methodologically novel, we have discovered that reported accuracies have been systemically overoptimistic due to the way in which validation datasets have been collected, reporting accuracy levels nearly 30...
متن کاملGender Inference of Twitter Users in Non-English Contexts
While much work has considered the problem of latent attribute inference for users of social media such as Twitter, little has been done on non-English-based content and users. Here, we conduct the first assessment of latent attribute inference in languages beyond English, focusing on gender inference. We find that the gender inference problem in quite diverse languages can be addressed using e...
متن کاملControlling for Latent Homophily in Social Networks through Inferring Latent Locations
Social influence cannot be identified from purely observational data on social networks, because such influence is generically confounded with latent homophily, i.e., with a node’s network partners being informative about the node’s attributes and therefore its behavior. We show that if the network grows according to either a community (stochastic block) model, or a continuous latent space mode...
متن کامل